Synthetic Dataset Generation of Driver Telematics
نویسندگان
چکیده
This article describes the techniques employed in production of a synthetic dataset driver telematics emulated from similar real insurance dataset. The generated has 100,000 policies that included observations regarding driver’s claims experience, together with associated classical risk variables and telematics-related variables. work is aimed to produce resource can be used advance models assess risks for usage-based insurance. It follows three-stage process while using machine learning algorithms. In first stage, portfolio space feature applying an extended SMOTE algorithm. second stage simulating values number as multiple binary classifications feedforward neural networks. third aggregated amount regression networks, set resulting evaluated by comparing datasets when Poisson gamma are fitted respective data. Other visualization data summarization remarkable statistics between two datasets. We hope researchers interested obtaining calibrate or algorithms will find our ot valuable.
منابع مشابه
Warwick-JLR Driver Monitoring Dataset (DMD): A public Dataset for Driver Monitoring Research
Driving is a safety critical task that requires the full attention of the driver. Despite this, there are many distractions throughout a vehicle that can impose extra workload on the driver, diverting attention from the primary task of driving safely. If a vehicle is aware that the driver is currently under high workload, the vehicle functionality can be changed in order to minimize any further...
متن کاملGENERATION OF SYNTHETIC EARTHQUAKE RECORDS BY ARTIFICIAL INTELLIGENCE TECHNIQUES
For seismic resistant design of critical structures, a dynamic analysis, based on either response spectrum or time history is frequently required. Due to the lack of recorded data and randomness of earthquake ground motion that might be experienced by the structure under probable future earthquakes, it is usually difficult to obtain recorded data which fit the necessary parameters (e.g. soil ty...
متن کاملFramework for Generation of Synthetic Ground Truth Data for Driver Assistance Applications
High precision ground truth data is a very important factor for the development and evaluation of computer vision algorithms and especially for advanced driver assistance systems. Unfortunately, some types of data, like accurate optical flow and depth as well as pixel-wise semantic annotations are very difficult to obtain. In order to address this problem, in this paper we present a new framewo...
متن کاملDataset Generation for Meta-Learning
Meta-learning tries to improve the learning process by using knowledge about already completed learning tasks. Therefore, features of dataset, so-called meta-features, are used to represent datasets. These meta-features are used to create a model of the learning process. In order to make this model more predictive, sufficient training samples and, thereby, sufficient datasets are required. In t...
متن کاملUCOM offline dataset-an urdu handwritten dataset generation
A benchmark database for character recognition is an essential part for efficient and robust development. Unfortunately, there is no comprehensive handwritten dataset for Urdu language that would be used to compare the state of the art techniques in the field of optical character recognition. In this paper, we present a new and publically available dataset comprising 600 pages of handwritten Ur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Risks
سال: 2021
ISSN: ['2227-9091']
DOI: https://doi.org/10.3390/risks9040058